valr: Reproducible genome interval analysis in R

نویسندگان

  • Kent A Riemondy
  • Ryan M Sheridan
  • Austin Gillen
  • Yinni Yu
  • Christopher G Bennett
  • Jay R Hesselberth
چکیده

New tools for reproducible exploratory data analysis of large datasets are important to address the rising size and complexity of genomic data. We developed the valr R package to enable flexible and efficient genomic interval analysis. valr leverages new tools available in the "tidyverse", including dplyr. Benchmarks of valr show it performs similar to BEDtools and can be used for interactive analyses and incorporated into existing analysis pipelines.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of Lower Extremity Joint Work and Initial Loading Rates among Four Different Running Styles

The annual injury incidence rate among the 36 million runners in the United States is approximately 50%. The majority of those injuries occur at the knee joint. The purpose of this study was to measure lower extremity eccentric work and average vertical loading rates among four groups of runners. Seventy-four healthy runners volunteered for this study. The four groups of runners consisted of tr...

متن کامل

Genome-wide Association Study to Identify Genes and Biological Pathways Associated with Type Traits in Cattle using Pathway Analysis

Extended Abstract Introduction and Objective: Type traits describing the skeletal characteristics of an animal are moderately to strongly genetically correlate with other economically important traits in cattle including fertility, longevity and carcass traits. The present study aimed to conduct a genome wide association studies (GWAS) based on gene-set enrichment analysis for identifying the ...

متن کامل

mmgenome: a toolbox for reproducible genome extraction from metagenomes

Summary: Recovery of population genomes is becoming a standard analysis in metagenomics and a multitude of different approaches exists. However, the workflows are complex, requiring data generation, binning, validation and finishing to generate high quality population genome bins. In addition, several different approaches are often used on the same dataset as the optimal strategy to extract a s...

متن کامل

INTERVAL ANALYSIS-BASED HYPERBOX GRANULAR COMPUTING CLASSIFICATION ALGORITHMS

Representation of a granule, relation and operation between two granules are mainly researched in granular computing. Hyperbox granular computing classification algorithms (HBGrC) are proposed based on interval analysis. Firstly, a granule is represented as the hyperbox which is the Cartesian product of $N$ intervals for classification in the $N$-dimensional space. Secondly, the relation betwee...

متن کامل

Interval linear regression

‎In this paper‎, ‎we have studied the analysis an interval linear regression model for fuzzy data‎. ‎In section one‎, ‎we have introduced the concepts required in this thesis and then we illustrated linear regression fuzzy sets and some primary definitions‎. ‎In section two‎, ‎we have introduced various methods of interval linear regression analysis‎. ‎In section three‎, ‎we have implemented nu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2017